AITopics | Charles County

M1: Chunking long documents into max-context-length chunks, and averaging all-mini-LM-v6 embeddings across chunks to produce a final document embedding.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Maine > Cumberland County > Standish (0.14)
North America > United States > California (0.05)
Asia > India > Rajasthan (0.04)
(9 more...)

Industry:

Health & Medicine (1.00)
Education (0.93)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

MetaSumPerceiver: Multimodal Multi-Document Evidence Summarization for Fact-Checking

Chen, Ting-Chih, Tang, Chia-Wei, Thomas, Chris

arXiv.org Artificial IntelligenceJul-17-2024

Fact-checking real-world claims often requires reviewing multiple multimodal documents to assess a claim's truthfulness, which is a highly laborious and time-consuming task. In this paper, we present a summarization model designed to generate claim-specific summaries useful for fact-checking from multimodal, multi-document datasets. The model takes inputs in the form of documents, images, and a claim, with the objective of assisting in fact-checking tasks. We introduce a dynamic perceiver-based model that can handle inputs from multiple modalities of arbitrary lengths. To train our model, we leverage a novel reinforcement learning-based entailment objective to generate summaries that provide evidence distinguishing between different truthfulness labels. To assess the efficacy of our approach, we conduct experiments on both an existing benchmark and a new dataset of multi-document claims that we contribute. Our approach outperforms the SOTA approach by 4.6% in the claim verification task on the MOCHEG dataset and demonstrates strong performance on our new Multi-News-Fact-Checking dataset.

computational linguistic, dataset, msp, (15 more...)

arXiv.org Artificial Intelligence

2407.13089

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Virginia (0.04)
North America > United States > Colorado > Denver County > Denver (0.04)
(11 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

D4: Improving LLM Pretraining via Document De-Duplication and Diversification

Tirumala, Kushal, Simig, Daniel, Aghajanyan, Armen, Morcos, Ari S.

arXiv.org Artificial IntelligenceAug-23-2023

Over recent years, an increasing amount of compute and data has been poured into training large language models (LLMs), usually by doing one-pass learning on as many tokens as possible randomly selected from large-scale web corpora. While training on ever-larger portions of the internet leads to consistent performance improvements, the size of these improvements diminishes with scale, and there has been little work exploring the effect of data selection on pre-training and downstream performance beyond simple de-duplication methods such as Min-Hash. Here, we show that careful data selection (on top of de-duplicated data) via pre-trained model embeddings can speed up training (20% efficiency gains) and improves average downstream accuracy on 16 NLP tasks (up to 2%) at the 6.7B model scale. Furthermore, we show that repeating data intelligently consistently outperforms baseline training (while repeating random data performs worse than baseline training). Our results indicate that clever data selection can significantly improve LLM pre-training, calls into question the common practice of training for a single epoch on as much data as possible, and demonstrates a path to keep improving our models past the limits of randomly sampling web data.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2308.12284

Country:

North America > United States > Maine > Cumberland County > Standish (0.14)
North America > United States > California (0.04)
Asia > India > Rajasthan (0.04)
(13 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine (1.00)
Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Schools deploy AI technology to protect against active shooters

FOX NewsMar-29-2023, 23:49:13 GMT

Fox News correspondent Matt Finn has the latest on the impact of AI technology that some say could outpace humans on'Special Report.' WASHINGTON – While most people look to artificial intelligence, or AI, for quick answers to complex problems, a growing number of school districts are turning to the technology to keep their students and staff safe. A school district in Charles County, Maryland, roughly an hour from Washington D.C., is in the process of installing software and hardware which would allow their current security cameras to detect a potential active shooter. "This artificial intelligence has the ability to be able to identify a weapon, to assess what's going on and how that person is acting," said Jason Stoddard, Director of School safety and Security for Charles County Public Schools. The district, through a state grant, is in the process of installing AI gun detection technology at all of its campuses.

active shooter, getty image, school deploy ai technology, (10 more...)

FOX News

Country:

North America > United States > Maryland > Charles County (0.47)
North America > United States > District of Columbia > Washington (0.26)
North America > United States > Tennessee > Davidson County > Nashville (0.06)
(3 more...)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Education > Health & Safety > School Safety & Security > School Violence (0.82)
Media > News (0.76)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.57)
Information Technology > Artificial Intelligence > Applied AI (0.40)

Add feedback

Toward Metric Indexes for Incremental Insertion and Querying

Raff, Edward, Nicholas, Charles

arXiv.org Machine LearningJan-12-2018

In this work we explore the use of metric index structures, which accelerate nearest neighbor queries, in the scenario where we need to interleave insertions and queries during deployment. This use-case is inspired by a real-life need in malware analysis triage, and is surprisingly understudied. Existing literature tends to either focus on only final query efficiency, often does not support incremental insertion, or does not support arbitrary distance metrics. We modify and improve three algorithms to support our scenario of incremental insertion and querying with arbitrary metrics, and evaluate them on multiple datasets and distance metrics while varying the value of $k$ for the desired number of nearest neighbors. In doing so we determine that our improved Vantage-Point tree of Minimum-Variance performs best for this scenario.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

1801.05055

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > District of Columbia > Washington (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
(9 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.67)

Add feedback

Is This How We Keep People from Starving Once the Robots Take Over?

#artificialintelligenceSep-6-2016, 13:50:35 GMT

What are we going to do for all of the people displaced by robots? From truck drivers to (gasp!) writers, the growing number of professions impacted or even altogether eliminated by automation and artificial intelligence presents a significant potential economic and political challenge. Automation and artificial intelligence are a potential crisis, not an inevitable one. One idea is to find a way to guarantee a minimal standard of living, even in a world where, due to technology, the number of available jobs may be much smaller. One way is to implement a relatively old idea known as the guaranteed basic income.

artificial intelligence, basic income, job market, (15 more...)

#artificialintelligence

Country:

North America > United States > Missouri > St. Charles County (0.06)
North America > United States > Maryland > Charles County (0.06)
North America > United States > California > Alameda County > Oakland (0.06)

Industry: Transportation > Ground > Road (0.54)

Technology:

Information Technology > Artificial Intelligence > Robots (0.88)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.37)

Add feedback